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ABSTRACT 

•^-^After  considering  the  concept  of  effect  sparsity  as  a  justification  for 
the  use  of  unreplicated  fractional  factorial  designs,  we  discuss  the  situation 
where  factors  may  influence  not  only  the  location  but  also  the  dispersion  of 
the  data.  The  aliasing  of  location  and  dispersion  effects  is  explored  and 
methods  for  identifying  an  appropriate  location  -  dispersion  model  are 
considered. 
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SIGNIFICANCE  AND  EXPLANATION 


/ 

'-7*  Unreplicated  fractional  factorial  designs  are  frequently  employed  as 
screening  designs  when  it  is  believed  that  a  condition  of  effect  sparsity  will 

V  .  j , 

ensure  that  only  a  few  of  the  possible  effects  are  likely  to  be  large.  W 
Suppose  it  is  believed  that  only  an  (unidentified)  few  of  a  number  of 
candidate  design  variables  such  as  temperature  x^,  pressure  X2 ,  .  ..# 
speed  xy  affect  a  quality  characteristic  y  such  as  observed  tensile 
strength.  In  the  past  the  emphasis  has  been  on  determining  the  effect  of  such 
variables  on  the  mean  value  or  location  of  y  in  terms  of  the  usual  main 
effects  and  interactions  which  we  will  here  call  location  effects.  We 
consider  in  this  paper  the  possibility  of  also  determining  the  effect  of 
variables  on  the  variance  or  more  generally  the  dispersion  of  y.  We  call 
such  effects  dispersion  effects.  The  nature  of  the  alias  relationships 
between  location  and  dispersion  effects  is  discussed  and  a  method  developed 
for  identification  of  dispersion  effects  when  location  effects  are  also 
present.  An  example  is  used  to  illustrate  these  ideas. 
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analyzing  two- level  fractional  factorial  experiments 
FOR  POSSIBLE  DISPERSION  EFFECTS 

George  E.p.  Box  and  R,  Daniel  Meyer 

1 .  INTRODUCTION 

Table  1  shows  in  summary  a  highly  fractionated  two-level  factorial  design  enployed* 
as  a  screening  design  in  an  off-line  welding  experiment  performed  by  the  National  Railway 
Corporation  of  Japan  (Taguchi  and  Hu,  1980).  In  the  column  to  the  right  of  the  table  is 
shown  the  observed  tensile  strength  of  the  weld,  one  of  several  quality  characteristics 

measured. 

The  design  was  chosen  on  the  assumption  that  in  addition  to  main  effects  only  the 
two-factor  interactions  AC,  AG,  Ml,  and  GH  were  expected  to  be  present.  On  that 
supposition,  all  nine  main  affects  and  the  four  selected  two-factor  interactions  can  be 
separately  estimated  by  appropriate  orthogonal  contrasts  and  the  two  remaining  contrasts 
corresponding  to  the  columns  labelled  e^  and  e2  measure  only  experimental  error. 

Below  the  table  are  shown  the  grand  average,  the  fifteen  effect  contrasts,  and  the  effects 
plotted  on  a  dot  diagram.  When  the  effects  are  plotted  on  normal  probability  paper, 
thirteen  plot  roughly  as  a  straight  line  but  the  remaining  two,  corresponding  to  the  main 
effects  for  factors  B  and  C,  fall  markedly  off  the  line,  suggesting  that  over  the 
ranges  studied,  only  factors  B  and  C  affect  tensile  location  by  amounts  not  readily 
attributed  to  noise. 

If  this  conjecture  is  true,  then,  at  least  approximately,  the  sixteen  runs  could  be 
regarded  as  four  replications  of  a  22  factorial  design  in  factors  B  and  c  only. 
However  when  the  results  are  plotted  in  Figure  1  so  as  to  reflect  this,  inspection  sug¬ 
gests  the  existence  of  a  dramatic  effect  of  a  different  kind  -  when  factor  c  is  at  its 

*To  facilitate  later  discussion  we  have  set  out  the  design  and  labelled  the  levels 
somewhat  differently  from  Taguchi. 
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TABLE  1.  a  fractional  two-level  design  used  in  a  welding  experiment  showing  observed  tensile 
strength  and  effects.  Below  the  estimated  effects  are  plotted  as  a  dot  diagram. 


plus  level  the  spread  of  the  residuals  appears  such  larger*  than  when  It  is  at  Its  minus 
level.  Thus  In  addition  to  detecting  shifts  in  location  due  to  B  and  C,  the 
experiment  may  also  have  detected  what  we  will  call  a  dispersion  effect  due  to  C.  The 
example  raises  the  general  possibility  pursued  in  the  remainder  of  this  paper  of  analysing 
unreplicated  designs  for  dispersion  effects  as  well  as  for  the  store  usual  location 
effects. 
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Figure  1.  Tensile  data  as  four  replicates  of  a  2  factorial 
design  in  factors  B  and  C  only. 


*Data  of  this  kind  might  be  accounted  for  by  the  effect  of  one  or  more  variables  other 
than  B  that  affected  tensile  strength  only  at  the  "plus  level"  of  C  (only  when  the 
alternative  material  was  used).  Analysis  of  the  eight  runs  made  at  the  plus  level  of  C 
does  not  support  this  possibility,  however. 


2.  RATIONALE  FOR  USING  SCREENING  DESIGNS 

Before  proceeding  we  need  to  consider  the  question,  "In  what  situations  are  screening 
designs,  such  as  highly  fractionated  factorials,  useful?" 

2.1.  Effect  Sparsity 

A  common  industrial  problem  is  to  find  from  a  larger  number  of  factors  those  few  that 
are  responsible  for  large  effects.  The  ides  is  comparable  to  that  which  motivates  the  use 
in  quality  control  studies  of  the  "Pareto  diagram."  (See,  for  example,  Ishikawa  1976). 

The  situation  is  approximated  by  postulating  that  only  a  small  proportion  of  effects  will 
be  "active"  and  the  rest  "inert".  We  call  this  the  postulate  of  effect  sparsity.  For 
studying  such  situations,  highly  fractionated  designs  and  other  orthogonal  arrays  (Tippet 
(1934)  Finney  (1945),  Plackett  &  Bur man  (1946),  Rao  (1947),  Taguchi  and  Wu  (1980))  which 
can  screen  moderately  large  numbers  of  variables  in  rather  few  runs  are  of  great 
interest.  Two  main  rationalizations  have  been  suggested  for  the  use  of  these  designs; 
both  ideas  rely  on  the  postulate  of  effect  sparsity  but  in  somewhat  different  ways. 

2.2.  Rationale  Based  on  Prior  Selection  of  important  Interactions 

It  is  argued  (see  for  example  Davies,  1954)  that  in  some  circumstances  physical 
knowledge  of  the  process  will  make  only  a  few  interactions  likely  and  that  the  remainder 
may  be  assumed  negligible.  For  example,  in  the  welding  experiment  described  above  there 
were  36  possible  two-factor  interactions  between  the  nine  factors,  but  only  four  were 
regarded  as  likely,  leaving  32  such  interactions  assumed  negligible.  The  difficulty  with 
this  idea  is  that  in  many  applications  the  picking  out  of  a  few  "likely"  interactions  is 
difficult  if  not  impossible.  Indeed  the  investigator  might  justifiably  protest  that,  in 
the  circumstance  where  an  experiment  is  needed  to  determine  which  first  order  (main) 
effects  are  important,  it  is  illogical  that  he  be  expected  to  guess  in  advance  which 
effects  of  second  order  (interactions)  are  important. 

2.3.  Projective  Rationale  Factor  Sparsity 

A  slightly  different  notion  is  that  of  factor  sparsity.  Thus  suppose  that,  of  the 
k  factors  considered,  only  a  small  subset  of  unknown  size  d,  whose  identity  is  also 
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unknown,  will  he  active  in  providing  main  affects  and  lntaractiona  within  that  subsot. 
Arguing  as  in  Box  and  Huntor  ( 1961 )  a  two  lsvsl  dasign  snabling  us  to  study  such  a  system 
is  a  fraction  of  resolution  R  -  d  +  1  (or  in  tha  terminology  of  Rao  (1947)  an  array  of 
strength  d)  which  produces  ccmpleta  factorials  (possibly  replicated)  in  every  one  of  the 
(£)  spaces  of  d  ■  R  -  1  dimensions.  For  example,  we  have  seen  that  on  the  assumption 
that  only  factors  B  and  C  are  Important,  the  welding  design  could  be  regarded  as  four 
replicates  of  a  22  factorial  in  just  those  two  factors.  But  because  the  design  is  of 
resolution  R  «  3  the  same  would  have  been  true  for  any  of  the  36  choices  of  two  out  of 
the  nine  factors  tested.  Thus  the  design  would  be  appropriate  if  it  were  believed  that 
not  more  than  two  of  the  factors  were  likely  to  be  "active.” 

For  further  illustration  we  consider  again  the  sixteen-run  orthogonal  array  of  Table 
1.  In  Table  2  adopt  a  roman  subscript  to  denote  the  resolution  R  of  the  design,  which 
associates  factors  with  the  dotted  columns  in  the  manner  shown. 


Columns 

1  2  3 

4  5  6  7  8  9  10 

11  12 

13  14  15 

(a) 

,15-11 

III 

•  •  * 

•  • 

•  •  • 

(b) 

28-4 

IV 

•  • 

•  •  • 

* 

•  • 

(c) 

25"1 

V 

•  • 

•  • 

• 

(d) 

24 

•  • 

•  • 

TABLE 

2.  Some 

alternative 

uses  of  the  orthogonal 

array  of 

Table  1 . 

It  will  be  seen  thati 

(a)  if  we  associated  the  fifteen  contrast  columns  of  the  design  with  fifteen 
factors,  we  would  generate  a  2jxi^  design  providing  four-fold  replication  of  22 
factorials  in  every  one  of  the  105  two-dimensional  projections. 

(b)  if  we  associated  only  columns  1,  2,  4,  7,  8,  11,  13,  and  14  with  eight  factors 

8—4  i 

we  would  generate  a  2^v  design  providing  two-fold  replication  of  2J  factorials  in 


every  one  of  the  56  three-dimensional  projections 


(c)  if  we  associated  only  columns  1,  2,  4,  8,  and  15  with  five  factora  we  would 
generate  a  2®”*  design  providing  a  24  factorial  in  every  one  of  the  four-dimensional 
projections. 

(d)  if  we  associate  only  columns  1,  2,  4  and  8  with  four  factors  we  would  obtain  the 
complete  2*  design  from  which  this  orthogonal  array  was  in  fact  generated. 

Designs  (a),  (b)  a  (c)  would  thua  be  appropriate  for  eituations  where  we  believed 

respectively  that  not  more  than  2,  3,  or  4  factors  would  be  active*.  Notice  that 

intermediate  numbers  of  factora  could  be  accommodated  by  suitably  omitting  certain 

9-5 

columns.  Thus  the  welding  design  is  a  2^^  arrangement  which  can  be  obtained  by  omitting 

15-11 

6  columns  from  the  complete  2^^  .  Notice  finally  that  for  intermediate  designs  we  can 
take  advantage  of  both  rationales  by  arranging  as  was  done  for  the  welding  design,  that 
particular  interactions  are  isolated. 

2.4.  Clues  and  Leadsi  Not  Final  Conclusions 

In  the  past  some  misunderstanding  about  the  value  of  fractional  designs  has  occurred 
because  they  were  not  always  considered  in  an  appropriate  context.  For  instance,  while 
the  above  analysis  of  the  welding  data  screening  design  does  not  lead  to  unequivocal 
conclusions,  it  does  suggest  the  possibility  of  location  effects  in  B  and  C  and  a 
dispersion  effect  in  C  which  are  worth  checking  out.  Typically  the  experimenter  will  be 
following,  with  the  help  of  a  statistician,  an  iterative  path  of  investigation  which 
cannot  be  forecast  at  the  outset  and  he  must  behave  very  much  like  a  detective 
investigating  a  mystery.  He  proceeds  by  following  clues  leading  to  further  searchings 
(experiments)  which  provide  further  clues  and  so  on.  The  prime  objective  is  to  achieve 
reasonably  rapid  convergence  of  the  investigatory  process.  While  it  is  true  that  more 
formal  statistical  procedures  may  be  needed,  to  confirm  finally  that  the  investigation  has 

*The  designs  give  partial  coverage  for  a  larger  number  of  factors,  for  example 

(Box  4  Hunter  (1961))  56  of  the  70  four-dimensional  projections  of  the  28-4 
yield  a  full  factorial.  IV 


really  reached  the  objective  claimed,  this  later  confirmatory  stage  is  usually  only  a 
small  part  of  the  experimental  effort. 

A  discussion  of  the  iterative  model  building  process  by  Box  &  Jenkins  (1970) 
characterized  three  steps  in  the  iterative  data  analysis  cycle  indicated  below. 


' - *  identification  -  *  fitting  ■  -  ♦  diagnostic  checking' 

Host  of  the  present  paper  is  concerned  with  model  identification-~that  is  the  selection  of 
a  model  worthy  to  be  entertained  and  fitted  more  formally  by  an  efficient  process  such  as 
maximum  likelihood. 

The  situation  we  now  address,  therefore,  concerns  the  identification  of  factor 
effects  from  fractional  designs  in  the  circumstance  of  effect  sparsity,  where  dispersion 


effects  as  trail  as  location  effects  may  be  present 


3.  DISPERSION  EFFECTS 


He  again  use  the  design  of  Table  1  for  illustration.  There  are  16  runs  from  which  16 
quantities-- the  average  and  15  effect  contrasts— have  been  calculated.  Now  if  we  were 
also  interested  in  possible  dispersion  effects  we  could  also  calculate  15  variance 
ratios.  For  example,  in  column  1  we  can  compute  the  sample  variance  s  ( 1 — )  for  those 
observations  associated  with  a  minus  sign  and  compare  it  with  the  sample  variance 
s2(1+)  for  observations  associated  with  a  plus  sign,  to  provide  the  ratio 
Fi  ”  s2 ( 1-)/s2 ( 1+) .  if  this  is  done  for  the  welding  data  we  obtain  values  for  lnF^* 
given  in  Figure  2(a).  It  will  be  recalled  that  in  the  earlier  analysis  a  large  dispersion 
effect  associated  with  factor  C  (column  15)  appeared  to  be  present  but  in  Figure  2(a)  the 
effect  for  this  factor  is  not  especially  extreme,  instead  the  dispersion  effect  for  factor 
D  (column  1)  stands  out  from  all  the  rest.  This  misleading  indication  occurs  b»  t 

have  not  so  far  taken  account  of  the  aliasing  of  location  and  dispersion  efe  Since 

sixteen  linearly  Independent  location  effects  have  already  been  calculated  for  the 
original  data,  calculated  dispersion  effects  must  be  functions  of  these.  The  general 
nature  of  the  location-dispersion  aliasing  is  explained  in  the  section  that  follows.  For 
illustration  equation  (1)  shows  the  identity  that  exists  for  the  dispersion  effect  that  is 
the  F  ratio  associated  with  factor  D  and  hence  for  column  1  of  the  design. 

In  the  expression  i  is  used  to  indicate  the  usual  location  effect  (contrast) 


. 

.  .  2 

.  .  2 

A  A  2  A  A  2 

-  .  2 

„  „  2  „  .  2 

F  »  F  -  <2'3) 
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(4-5) 
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(6-7 )  +  (8-9) 

+  (10-11) 

+  (12-13)  +  (14-15) 
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.  -  2 

.  .  2  ,  .  2 

.  «  2 

.  *  2  .  „  2 
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(2+3) 
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+ 

(6+7)  +  (8+9) 

+  (10+11) 

+  (12+13)  +  (14+15) 

* 

A  A 

m 

Now  (see  Table  1) 

B 

-  14  - 

2. 

15  and  C  ■  15 

»  3.10  are 

the  two  largest  location 

effects,  standing  out  from  all  the  others.  The  extreme  value  of  FD  associated  with  an 


•In  this  figure  familiar  normal  theory  significance  levels  are  also  shown.  Obviously  the 
necessary  assumptions  are  not  satisfied  in  this  case,  but  these  percentages  provide  a 
rough  indication  of  magnitude. 
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apparent  dispersion  effect  for  factor  D  is  thus  largely  accounted  for  by  the  squared  sub 

A  A 

and  squared  difference  of  the  location  effects  B  and  C  which  appear  respectively  as 
the  last  terms  in  the  denominator  and  numerator  of  equation  1.  A  natural  way  to  proceed 
is  to  compute  variances  from  the  residuals  obtained  after  eliminating  large  location 
effects.  After  such  elimination  the  alias  relations  of  equation  1  remain  the  same  except 
that  location  effects  from  eliminated  variables  drop  out.  That  is>  zeros  are  substituted 
for  eliminated  variables.  Variance  analysis  for  residuals  after  eliminating  effects  of  B 
and  c  are  shown  in  Figure  2(b).  The  dispersion  effect  associated  with  C  (factor  15) 
is  now  correctly  indicated  as  extreme. 


4.  DISPERSION  AND  LOCATION  ALIASING 


4.1.  Identities  Existing  Between  i  it-perglon  and  Location  Effects 

In  order  to  study  the  identity  relations  existing  between  location  and  disperaon 
effects  consider  an  n  *  n  orthogonal  array  with  n  »  2^  columns  of  +1'a  and  -I'a 
labelled  , . . . •  Let  jj0  ■  I  be  a  column  of  tl's  and  the  remaining  columns 

delineate  the  usual  contrasts  for  the  main  effects  and  interactions  of  a  2*1  factorial 
design.  In  general  we  suppose  that  the  array  is  to  be  used  as  a  2k"P  fractional  or  full 
factorial  to  test  k  factors,  so  that  q  »  k  -  p  with  p  >  0. 

To  illustrate  ideas  we  will  employ  an  eight  run  orthogonal  array  although  in  practice 
this  would  often  be  too  small  a  design  to  allow  variance  effects  to  be  studied.  Setting 
q  “  3  the  columns  of  the  resulting  23  design  in  factors  A,  B,  and  C  can 
alternatively  be  labelled  with  numbers  or  letters  as  follows: 


0 

1 

2 

3 

4 

5 

6 

7 

I 

A 

B 

AB 

C 

AC 

BC 

ABC 

+1 

-1 

-1 

+1 

-1 

+1 

+1 

-1 

+1 

+1 

-1 

-1 

-t 

-1 

+1 

+  1 

+1 

-1 

+  1 

-1 

-1 

+  1 

-1 

+  1 

+1 

+  1 

+1 

+1 

-1 

-1 

-1 

-1 

+1 

-1 

-1 

+1 

+1 

-1 

-1 

+  1 

+1 

+1 

-1 

-1 

+1 

+1 

-1 

-1 

+1 

-1 

+1 

-1 

♦  1 

-1 

■M 

-1 

+1 

+  1 

+1 

+1 

+1 

+  1 

♦1 

+1 

As  is  well  known  the  array  may  be  used  as  a  full  factorial  or  as  a  fractional  design.  For 

example,  associating  three  factors  with  columns  1,  2,  4  reproduces  the  23  factorial, 

4-1 

four  factors  associated  with  columns  1,  2,  4,  7  produces  a  2JV  fractional,  seven 

7-4 

factors  associated  with  columns  1  through  7  produces  a  fractional. 

In  general  the  columns  JSg'Ki ' • •• 'Sn-1  form  a  group  closed  under  multiplication 

defined  such  that  product  column  has  for  its  u6*1  element  x^ju  « 

xiuxju  *u  “  1,*..,n).  Now  suppose  we  are  interested  in  contrasting  variances  at  the  lower 
and  upper  levels  of  factor  i.  Consider  the  elements  of  a  column  V2  (Xq  4  x^)(i  *  0 > * 
these  are 
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V2<x0u  *  Xiu) 


^<x0u  +  "in’ 


*'  if  xiu  ’  -1 


0  if  x.  -  +1 
iu 


0  if  x.  -  -1 
iu 


+  1  if  X.  -  +1 

iu 


Thus  the  elements  of  s  column  V»  ( x .  ±  x, . )  are 

*-ij 


^<Xju  •  xijuJ  ■1/^xju<xOu  ■  xiu> 


if  xiu-"1 


if  X .  -  +1 

iu 


^(xju  +  xiju)  “^xiu(XOu  +  *lJ 


X.  if  X.  -  +1 
„  ju  iu 


Returning  for  illustration  to  the  2 3  design,  if  for  example  we  wished  to  compare 
variances  at  the  lower  and  upper  level  of  factor  C  -  (i.e.  for  the  first  and  last 

four  observations)  then  setting  i  -  4  it  will  be  useful  to  generate  the  columns 
-  »4. j>  J  -  0,1,. ..,7  thus: 


J 

0 

4 

1 

5 

2 

6 

3 

7 

+1 

-1 

-1 

+1 

-1 

♦  1 

+1 

-1 

+  1 

-1 

+1 

-1 

-1 

+1 

-1 

+1 

+1 

-1 

-1 

+1 

+1 

-1 

-1 

+1 

V2  <i$j  -  *4.  j> 

♦  1 

-1 

+1 

-1 

♦1 

-1 

+1 

-1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Note  that  for  every  i 

the  columns 

% 

■  *i«j 

)  will  in 

general 

appear  in  j 

-  2' 

identical  (apart 

from 

sign)  pairs. 

Now 

suppose 

data  x  ' 

■  <y1 » •  • 

•'V-'V 

are 

available  and  let  j  •  x'£j  frost  which  the  estimated  effect  of  factor  j  may  be  obtained 
by  dividing  by  an  appropriate  constant.  Then  for  every  i  the  quantities 
X'*£j  ”  "  J  “  i*j  provide  an  exhaustive  set  of  n/2  linearly  independent 
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contrasts  of  thoss  n/2  observations  yu  for  which  xlu  -  -1.  Correspondingly,  tho 
columns  Xj  ♦  aitj  provide  a  similar  set  of  contrasts  for  the  remaining  observations  for 
which  xlu  -  +1,  Denote  by  S( i- )  and  S( i+ )  the  sums  of  squares  of  the  yu  for 
which  xiu  -  -1  and  +1  respectively.  Then 


8(1-)  I  tox'<* j-2 i.jJ  -  J  I 

B  J-0  313  n  j-0  2 


—»-$2 

J-0  J  J  J-0 

For  example ,  for  the  8x8  array  derived  from  the  23  factorial, 

2  2  2  2 

8(4-)  -  y^  +  y2  +  y3  ♦  y4 

*  *  5  *  *  j  *  *  3  *  A  3  *  *  3  **2  *  *  2  *  *  - 

-  i  [(*?)  ♦  (^)  -  (2?)  ♦  (¥)  +  (2f)  +  (¥)  +  (t)  ♦  l2?)  1 


a  *  3  as*  *  a  «  a  a  . 

i  1(2?-)  *  (2f)  *  (2?)  .  (2f)  ] 


i  [(H)J .  (iak)1 .  (i^)1 .  (ilk)1]  . 


4.2.  Blimlnatlon  of  Location  Effects 

The  sums  of  squares  in  (4),  (5)  would  be  appropriate  to  compute  dispersion  effects 
only  if  it  could  be  assumed  that  all  the  location  effects,  including  the  overall  mean, 
were  known  to  be  zero.  If  this  were  not  the  case  then  the  sums  of  squares  S(i-)  and 
8(i+)  could  be  inflated  by  location  effects.  Vo  remove  such  effects  we  can  replace  the 
yu's  in  (5)  by  residuals  yu  -  yu  obtained  after  eliminating  all  suspected  location 
effects  including  the  mean  by  least  squarss. 

Now  the  vector  of  residuals  is  orthogonal  to  each  column  vector  corresponding  to  an 
eliminated  variable.  It  follows  that  sums  of  squares  calculated  from  such  residuals  will 


■  -V-  ■ 

.  V -  V  *.  .-v;-.-:  •v  -.v.-v 


have  tha  same  fora  as  (4)  but  with  all  estiMted  affacta  which  correspond  to  aliainatad 
variablas  aat  equal  to  zero . 


Further  understanding  ia  gained  by  considering  tha  expected  values  of  8(1-)  and 
S(i+)  under  various  circumstances.  Suppose  a  difference  in  variance  night  exist 
associated  with  the  level  of  the  single  coluan  x^  and  the  subs  of  squares  S(i-)  and 
S( 1+)  are  computed  from  (4)  but  with  yu  replaced  by  residuals  after  a  number  of 

A 

location  effects  have  been  eliminated.  Then  after  setting  to  zero  all  the  elements  j 
and  i*j  in  (5)  which  correspond  to  eliminated  variables,  suppose  there  are  £  cases 
where  bracketed  pairs  (j,  i*j)  have  been  eliminated  and  m  cases  where  only  one  element 
of  a  bracketed  pair  has  been  eliminated  so  that  there  remains  ~  -  t  -  m  complete 


bracketed  pairs. 

Now  for  a  bracketed  pair 

«[f  {j  -  o2<l-)  (6) 

and  for  a  single  element 

'[£  tj  ' 7  •  (7) 

It  follows  that 

b[s<1-)]  •  (j  n  -  t  -  j  m)o2(i-)  ♦  i  sw2(i+)  (8) 

b[s(1+)]  ■  (jn-»-j  m)c2(i>)  ♦  j  mo2(i-)  .  (9) 

If  we  define 

s2(i-)  -  S(i-)/(i  n  -  i  -  j  a)  ,  (10) 

then 

*[e2 (1- ) ]  -  02(i-)  ♦  ;n--.  4T~-  ~2a  (°2<i+)  •  «2<1->)  (11) 


and  similarly  for  s2(i+)  with  the  roles  of  o2(i-)  and  o2(i+)  reversed. 

It  ehould  be  noted  that,  in  the  circumstance a  of  effect  sparsity  here  considered,  the 

2  2 

bias  term  in  (11)  involving  o  (1+)  -  0  (i-)  would  be  rather  small.  For  example,  suppose, 
with  a  design  having  n  -  16  runs,  that  £  ■  2  and  m  •=  1,  then  the  bias  term  will  be 


{o2(i+)  -  o2(i-)}/22.  it  seems  reasonable  to  conclude  that  for  purpoaaa  of  nodal 
identification  the  elimination  of  location  effecta  by  simply  taking  reaiduala  ia  unlikely 
to  uialead. 

4.3.  Illuatratlona  with  the  8x8  Array 

The  general  aituation  nay  be  better  underatood  by  conaidering  a  few  special  caaee 

again  uaing  for  illustration  the  8  x  8  factorial  array.  Setting  i  “  4  -  C,  suppose  we 

2  2 

wish  to  obtain  the  dispersion  effect  a  (4-)/a  (4+)  which  contrasts  the  variances  of  the 
first  four  and  last  four  observations. 

Elimination  of  Grand  Mean 

A 

Elimination  of  the  mean  which  would  usually  be  unknown  results  in  the  removal  of  0 
in  equations  (4).  For  the  8*8  array i  n  *  8,  i  -  0,  a  «  1 

,2(4-)  -  (_i  ((4)2  +  O  -  5)2  ♦  (2  -  6>2  +  (3  -  7>2)}  /(7/2> 

16 

and  using  (11) 

e[s2(4-)]  -  o2(4-)  [o2(4t)  -  <j2(4->] 

A 

The  slight  bias  in  the  variance  estimate  arises  because  the  isolated  effect  4  is  a 
function  of  all  eight  observations. 

A 

Elimination  of  the  Mean  and  Effect  4 

If  now  the  location  effect  associated  with  factor  4  is  eliminated  as  well  as  the 
overall  mean  then  a  complete  pair  ia  removed  in  (5)  and  in  this  example 

7  1rA*2  aaa  aa2i 

a  (4-)  -  -I  {(1  -  5 j  *  (2  -  6/  +  (3  -  7)*}  /3 

e[b2<4-)]  -  02<4-> 

No  bias  now  occurs  because  elimination  of  0  and  4  is  equivalent  to  eliminating  means 

2 

separately  from  the  first  four  and  the  last  four  observations,  and  s  (4-)  is  a  function 
of  only  the  first  four  observations.  Similar  effects  are  found  with  all  bracketted 


pairs.  Thus  if  we  eliminate  factor  2  and  the  interaction  2*4  *  6  the  bias  ter*  doee 
not  appear  because  allowance  is  being  nade  for  different  affects  of  factor  2  at  the  two 
levels  of  factor  4. 

4.3.  Dispersion  Interactions 

Since  more  than  one  dispersion  effect  might  be  present  we  need  to  consider  the 
possibility  of  interaction.  If  the  effect  of  changing  from  the  minus  level  to  the  plus 
level  of  a  factor  i  ia  to  multiply  the  variance  by  ^  irreapective  of  whether  the  plus 
or  minus  level  of  factor  j  ia  amployad  we  shall  say  that  there  is  no  dispersion 
interaction  between  i  and  j.  In  such  a  casa  the  variancee  for  the  various  factor 
combinations  are  as  follows 


+  q2(i-,J+)  -  i  o2  o2(i+,J+>  -  ♦i4ja2 


o2(i-,j->  “  O2 


o2U+,J->  -  ♦j®2 


Equivalently  for  the  logged  variancea  the  dispersion  effects  will  be  additive  and  in  this 
metric  dispersion  Interactions  of  all  orders  may  be  defined  in  the  usual  way.  It  shall  be 
noted  that  when  there  is  no  dispersion  interaction  the  ratio  of  the  average  variance  at 
the  plus  and  minus  levels  for  factor  i  is 

q2(l+,J-)  ♦  q2(l+,j+)  _  V1  *  V°  „  ^ 
q2(i~» j~)  ♦  q2(i-,j+)  (1  +  ♦jjq2  1 

and  similarly  for  factor  j  and  4 j .  Thus  even  when  there  is  more  than  one  dispersion 
effect  the  simple  analysis  described  above  could  still  be  of  value  as  a  preliminary 
analytical  device  for  indicating  which  factors  needed  further  study.  In  particular  if  two 
factors  1  and  j  appeared  to  exhibit  dispersion  effects,  then  further  analysis  would  be 
appropriate  to  consider  the  general  evidence  for  activity  of  these  effects  taking  account 
also  of  possible  interaction,  nils  could  be  done  by  considering  general  differences  among 


V 
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tha  sums  of  squares  associatad  with  tha  four  calla  S( 1— # j— ) ,  8(1-, j+),  S(i+,j-), 

S(i*«J+)  of  tha  two-way  table  for  tha  two  factora.  Aa  bafora  theae  aunt  a  of  aouarea  would 
be  calculated  from  reaiduala  after  eliminating  location  effects.  The  consequences  of 
doing  thia  ia  explored  in  the  Appendix  which  givea  a  matrix  generalization  of  earlier 
reaulta . 

2  2 

A  convenient  function  for  coaiperlng  a  net  of  variancea  sf,...,sk  having  Vj,...,vk 
degree*  of  freedom  reapectively  ia  Bartlett 'a  criterion, 

k  ,  k  ,  k 

M  »  W  in(w  l  v a*)  -  J  v .in  a*  ,  where  N  -  l  v  . 

t-1  z  t-1  t-1 

When,  aa  would  frequently  be  the  caae,  the  acreening  design  ia  of  only  moderate  aize  one 
could  not  expect  to  atudy  aimultaneoualy  a  large  number  of  factors  in  this  way.  For 
example,  for  n  -  16,  tha  individual  calla  from  which  B(i-,j-)  8(i-,J+),  etc.  would  be 
calculated  will  each  contain  only  four  obaervations .  However  when,  in  circumstances  of 
effect  sparsity,  only  a  very  few  such  effecta  are  likely  to  be  of  appreciable  magnitude, 
the  above  analysis  could  be  of  value. 

We  again  illustrate  with  the  welding  data.  Figure  3(a)  shows  the  35  distinct  values 
of  M  computed  for  tha  data.  There  are  ( ^ )  “  105  ways  of  choosing  two  columns  from 
the  15  columns  of  the  design  but  these  are  aliased  in  sets  of  three  (any  coluarn  is  the 
product  of  two  other  columns).  Thus  the  largest  value  la  associated  with  columns  15  -  C, 
2  ■  H,  and  13  «  J.  This  effect  could  equally  well  be  attributed  to  factora  C  and  H 
with  interaction  CH  «  J  or  to  C  and  J  with  interaction  CJ  •  H  or  to  H  and  J 
with  Interaction  HJ  -  C.  It  is  noteworthy  that  the  seven  largest  values  of  M  which 
stand  out  from  the  rest*  all  include  factor  C  in  their  triplets.  Also  if  the  dispersion 
effect  of  C  ia  eliminated  by  rescaling  the  residuals  the  plot  (Fig.  3(b))  no  longer 


shows  outstanding  pointa 


5.  MAXIMUM  LIKELIHOOD  ESTIMATES  OF  LOCATION  AND  DISPERSION  EFFECTS 


Once  a  model  has  been  Identified  a  more  precise  fitting  is  possible  using  maximum 
likelihood.  Hartley  and  Jayatillake  (1973)  have  shown  that  the  following  method  will  give 
convergence  to  a  stationary  point  of  the  likelihood.  Conditional  on  the  dispersion 
effects,  location  effects  may  be  obtained  by  weighted  least  squares;  the  dispersion 
effects  may  now  be  recomputed  from  the  residuals  and  the  iteration  continued  until 
cover gence  is  achieved.  It  is  often  convenient  to  assume  initially  that  there  are  no 
dispersion  effects. 

For  illustration  the  following  table  shows  maximum  likelihood  estimates  for  the 
welding  data  assuming  location  effects  for  B  and  C  and  a  dispersion  effect  for  C. 

The  earlier  approximate  estimates  are  indicated  for  comparison. 

o  Z(C+) 

A  A  A  A  A  A  A  A 

V  B  CO  Z(C+)  O  Z(C->  O  *(C-) 

Maximum  likelihood  estimates  42.96  2.04  3.10  ,469  .021  22.3 

Earlier  approximate  estimates  43.00  2.15  3.10  .564  .031  IS. 2 

Table  3.  Estimates  of  location  and  dispersion  affects:  welding  data. 


Appendix 

The  results  of  section  4  can  be  generalised  using  matrix  algebra.  Again,  let  £  be 
a  matrix  of  *  1's  with  orthogonal  columns  jc0,...,xji_1  and  a  column  of  1's.  If  y 

A 

denotes  the  n  x  1  vector  of  observations,  we  define  j  -  x^j.  Suppose  we  wish  to 
compute  S(i-)  -  sum  of  squares  of  the  yu  at  the  minus  level  of  column  Jj1.  Without 
loss  of  generality,  we  assume  x^  ”  (-1  ,-1 , . . .  ,-1 , 1 , 1 , . . . ,  1 ) ' ,  and  let  ji  be  the  n  x  n 
identity  matrix.  Define  the  n  *  n  matrices  by 


I  0 

I  -  (  n/2  ~) 
*1  V  0  0J 


0  0 

~2  "  ^0  I  ^ 
~  ~n/2 


where  1^/2  18  the  2*2  i<Jentity  “«trlx. 


fti«n  S(l-)  can  b«  written 


S(l-)  -  • 

Noting  that  -  XX'  -  I  and  I,X  ■  7  [x  -  (I  -  21  jxl 
we  have 

S(i-)  1(5  -  <1  -  2i,)x)  'xl'Ux  -  <1  -  2X,)x)*Xj  . 

Now  observe  that  (J  — 2J_^  JJC  la  just  the  matrix  jC  with  every  column  multiplied  by 
Therefore , 

(X  -  <1  -ll^xj’y  -  (0-1*0,  1-1*1 . n-1  -  i*n-1)’ 

and 

.  n-1  .  “1 

sd-)  -  j-  la-  i*j> 

n  j-0 

as  was  shown  In  Section  4,  equation  (4).  Similarly,  we  can  write 

S(1+)  ’  in  (J^V* 

j-0 

As  was  also  shown  In  section  4,  eliminated  location  effects  will  drop  out  of  the  above 
expressions. 

2  2 

To  compute  the  expectation  of  S(i-),  let  O  (1-)  and  o  (1+)  be  the  variance  of 
yu  at  the  minus  and  plus  levels  of  X|,  and  let  the  matrix  g  be  those  columns  of  £ 
corresponding  to  legation  effects  included  in  the  model  (i.e.  eliminated  to  obtain 
residuals),  the  least  squares  estimates  of  location  effects  are  given  by  t  -  —  z’y. 
Then 


S(l-)  -  (y  -  ZT)  ‘I^ty  -  ZT) 


X'(l  ^  Zg’)x 


y*  (I,  -  -  zz’x,  -  -  i-zz’  +  ~  zz'i.zz’ )y 
*  ~1  n  —  ~1  n  ~  1 —  2  —  ~1 —  * 

n 
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kssuaing  E[yl  «  £1  *”d  using  the  identity 


Therefore 


K  ■  — y  p  +  21  . 

4 

and 

Efs(i-)]  -  (j  -  p  ♦  P  *-2-)a2(l-)  +  -  -2|-)o2(i+) 

-  (f  -  *  -  |  n]o2(i-  +  |  o2(i+) 

where  m  »  p  -  2t. 

To  extend  these  derivations  to  cosiputations  involving  two  variables,  define  matrices 
X3»  X4«  Xg  *• 


where  is  the  j  *  •j  identity  matrix.  Now  suppose  we  wish  to  compute  the  sum  of 

squares  of  the  yu  at  the  minus  levels  of  and  S(i-,j-).  without  loss  of 

generality,  assume  and  Xj  ”  (-1, . . .  ,-1|  1 , . . . ,  1|-1, . . . 


1 , . . . ,  1 ) .  Then 

S<i-,j->  - 

Now  use  the  facts  I  «  —  XX', 
n 

i3  “  7  U  -  U  -  2(i3  +  X4>)  -  U  -  2<i3  ♦  *  U  •  2<~4  + 


and 


Group  2:  Those  columns  such  that  only  one  of  2Sj*£)(<  £i*3j*-Sk 

is  in  Z  . 

Group  1:  Those  columns  not  in  previous  three  groups. 

Let  mk  =  number  of  columns  in  group  k»  note  that  m^  is  a  multiple  of  k.  Further 
subdivide  group  2  into  three  subsetB 

Group  2.1:  those  pairs  with  _g  ■  •8i*'5k 

Group  2.2:  those  pairs  with  z ^  m  “~v 

Group  2.12:  those  pairs  with  z^  “  ~i*~j*~k" 

Let  m2.k  be  ^he  number  of  columns  in  group  2.k»  m  m^^  +  B2.2  +  ®2.12* 

Thus  it  can  be  shown  that 


E[s(i-,j-)]  -  <J2(i-,i-)[ 


4n  -  7p  +  3m^  +  2m^  +  m^ 


16 


+  02<i-,j+)[ 


P  "  m4  "  3"  m3  “  ®2  +  ^m2  1 
_  ] 


2 

-  P  “  '  t  +  2m.  _ 

♦  d2(i+<j.,[ — 1 


p  “  m  —  —  in  —  m_  +  2in_  .  _ 

+  o2(i+,j+)[- - 1  -3-;6-2 - 2^2] 


Note  that  S(i-,j-)  will  be  unbiased  (up  to  a  scale  factor)  if  all  columns  of  %  are  in 
group  4  (p  =  m^,  m3  =  i"2  =  m^  =  0)  i.e.  for  each  variable  eliminated,  variables 

~i*k  ~j*k*  ~i*j*k  are  also  eliminated.  Similar  expressions  for  expectations  of 
S(i-,j+),  S(i+,j-)  and  S(i+,j+)  can  be  worked  out  quite  easily  from  the  above  formula 


by  switching  signs  on  i±,j± 
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